Adaptation of HMMS in the presence of additive and convolutional noise
نویسنده
چکیده
The performance of speech recognizers deteriorates in case of a mismatch between the conditions during training and recognition. One difference is the presence of a stationary background noise during recognition which is also referred to as “additive” noise. Furthermore the recognition is influenced by the frequency response of the whole transmission channel from the speaker to the audio input of the recognizer. The term “convolutional” noise has been introduced for this type of distortion. Several approaches are known to compensate these effects individually or both together [1]-[4]. This paper describes an approach which compensates both types of noise. The scheme is based on an estimation of the noise spectrum [SI. Furthermore the frequency response is iteratively estimated by using the alignment information of the best path in the Viterbi algorithm. The comparison between the spectra of the input signal and the spectra of the corresponding HMM (Hidden Markov Model) states is taken as basis for the filter estimation. The estimated additive and convolutional noise components are used as input to the well known Parallel Model Combination (PMC) approach [6] to adapt the whole word HMMs of a speaker independent connected word recognizer. Considerable improvements can be achieved in the presence of just one type of noise as well as in the presence of both types together.
منابع مشابه
Rapid response and robust speech recognition by preliminary model adaptation for additive and convolutional noise
Users require speech recognition systems that offer rapid response and robustness (high accuracy). Speech recognition accuracy suffers from additive noise, imposed by ambient noise, and convolutional noise, created by space transfer characteristics. Existing model adaptation techniques achieve robustness by using HMM-composition and CMN (cepstral mean normalization). Since they need the additiv...
متن کاملA Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images
Convolutional neural network is one of the effective methods for classifying images that performs learning using convolutional, pooling and fully-connected layers. All kinds of noise disrupt the operation of this network. Noise images reduce classification accuracy and increase convolutional neural network training time. Noise is an unwanted signal that destroys the original signal. Noise chang...
متن کاملCapacity Bounds and High-SNR Capacity of the Additive Exponential Noise Channel With Additive Exponential Interference
Communication in the presence of a priori known interference at the encoder has gained great interest because of its many practical applications. In this paper, additive exponential noise channel with additive exponential interference (AENC-AEI) known non-causally at the transmitter is introduced as a new variant of such communication scenarios. First, it is shown that the additive Gaussian ch...
متن کاملNoise and room acoustics distorted speech recognition by HMM composition
This paper presents a robust speech recognition method based on the HMM composition for the noisy room acoustics distorted speech. The method realizes an improved user interface such as the user is not encumbered by microphone equipments. The proposed HMM composition is obtained by naturally extending the HMM composition method of an additive noise to that of the convolutional room acoustics di...
متن کاملتجزیه و تحلیل عملکرد کدهای کانولوشنال تعریف شده در میدان حقیقی در کانال فیدینگ و در حضور نویز کوانتیزاسیون
Channel coding is one of the techniques used for suppression of fading and channel noise effects. Most of the researches on channel codes have been focused on the codes which are defined on Galois fields. Since two decades ago, using channel codes defined on Real and Complex fields has been considered for joint sourcechannel coding in multimedia transmission; but most of the works have focused ...
متن کامل